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Abstract 

In a column-restricted covering integer program (CCIP), all the non-zero entries of any 
column of the constraint matrix are equal. Such programs capture capacitated versions of 
covering problems. In this paper, we study the approximability of CCIPs, in particular, their 
relation to the integrality gaps of the underlying 0,1-CIP. 

If the underlying 0,1-CIP has an integrality gap 0(7), and assuming that the integral- 
ity gap of the priority version of the 0,1-CIP is 0{lo), we give a factor 0{''y+uj) approximation 
algorithm for the CCIP. Priority versions of 0,1-CIPs (PCIPs) naturally capture quality of 
service type constraints in a covering problem. 

We investigate priority versions of the line (PLC) and the (rooted) tree cover (PTC) 
problems. Apart from being natural objects to study, these problems fall in a class of 
fundamental geometric covering problems. We bound the integrality of certain classes of 
this PCIP by a constant. Algorithmically, we give a polytime exact algorithm for PLC, show 
that the PTC problem is APX-hard, and give a factor 2-approximation algorithm for it. 

1 Introduction 

In a 0,1-covering integer program (0,1-CIP, in short), we are given a constraint matrix A G 
{0, l}"*'^"-^ demands h € 17^, non-negative costs c G , and upper bounds d G , and the goal 
is to solve the following integer linear program (which we denote by Cov(^, 6, c, d)). 

minjc^x : Ax > b,0 < x < d,x integer}. 

Problems that can be expressed as 0,1-CIPs are essentially equivalent to set multi-cover 
problems, where sets correspond to columns and elements correspond to rows. This directly 
implies that 0,1-CIPs are rather well understood in terms of approximability: the class admits 
efficient O(logn) approximation algorithms and this is best possible unless NP = P. Neverthe- 
less, in many cases one can get better approximations by exploiting the structure of matrix A. 
For example, it is well known that whenever A is totally unimodular (TU)(e.g., see [19]), the 
canonical LP relaxation of a 0,1-CIP is integral; hence, the existence of efficient algorithms for 
solving linear programs immediately yields fast exact algorithms for such 0,1-CIPs as well. 

While a number of general techniques have been developed for obtaining improved approx- 
imation algorithms for structured 0, 1-CIPs, not much is known for structured non-0, 1 CIP 
instances. In this paper, we attempt to mitigate this problem, by studying the class of column- 
restricted covering integer programs (CCIPs), where all the non-zero entries of any column of 
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the constraint matrix are equal. Such CIPs arise naturally out of 0, 1-CIPs, and the main focus 
of this paper is to understand how the structure of the underlying 0,1-CIP can be used to derive 
improved approximation algorithms for CCIPs. 

Column-Restricted Covering IPs (CCIPs): Given a 0,1-covering problem Cov{A,b,c,d) 
and a supply vector s € ZIJ:, the corresponding CCIP is obtained as follows. Let A[s] be the 
matrix obtained by replacing all the I's in the jth column by sj; that is, A[s]jj = AijSj for all 
1 < i < m,l < j < n. The column-restricted covering problem is given by the following integer 
program. 

min{c"^x : A[s]x > b,0 < x < d,x integer}. {Cov{A[s],b,c,d)) 

CCIPs naturally capture capacitated versions of 0,1-covering problems. To illustrate this we 
use the following 0,1-covering problem called the tree covering problem. The input is a tree 
T = (y,E) rooted at a vertex r £ V, a set of segments S C {{u,v) : u is a child of v}, non- 
negative costs Cj for all j G S, and demands be £ Z+ for all e £ E. An edge e is contained in 
a segment j = (n, v) if e lies on the unique u, f-path in T. The goal is to find a minimum-cost 
subset C of segments such that each edge e € E is contained in at least be segments of C. When 
T is just a line, we call the above problem, the line cover (LC) problem. In this example, the 
constraint matrix A has a row for each edge of the tree and a column for each segment in S. It is 
not too hard to show that this matrix is TU and thus these can be solved exactly in polynomial 
time. 

In the above tree cover problem, suppose each segment j G S also has a capacity supply sj 
associated with it, and call an edge e covered by a collection of segments C iff the total supply of 
the segments containing e exceeds the demand of e. The problem of finding the minimum cost 
subset of segments covering every edge is precisely the column-restricted tree cover problem. 
The column-restricted line cover problem encodes the minimum knapsack problem and is thus 
NP-hard. 

For general CIPs, the best known approximation algorithm, due to Kolliopoulos and Young [16| . 
has a performance guarantee of O(l-l-loga), where a, called the dilation of the instance, denotes 
the maximum number of non-zero entries in any column of the constraint matrix. Nothing better 
is known for the special case of CCIPs unless one aims for bicriteria results where solutions violate 
the upper bound constraints x < d (see Section 11.11 for more details) . 

In this paper, our main aim is to understand how the approximability of a given CCIP 
instance is determined by the structure of the underlying 0, 1-CIP. In particular, if a 0, 1-CIP has 
a constant integrality gap, under what circumstances can one get constant factor approximation 
for the corresponding CCIP? We make some steps toward finding an answer to this question. 

In our main result, we show that there is a constant factor approximation algorithm for 
CCIP if two induced 0, 1-CIPs have constant integrality gap. The first is the underlying origi- 
nal 0,1-CIP. The second is a priority version of the 0,1-CIP (PCIP, in short), whose constraint 
matrix is derived from that of the 0,1-CIP as follows. 

Priority versions of Covering IPs (PCIPs): Given a 0,1-covering problem Cov(^, 5, c, d), 
a priority supply vector s G Zi, and a priority demand vector vr G ZT, the corresponding PCIP 
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is as follows. Define ^[s,7r] to be the following 0,1 matrix 

A[s,^Uj - I : otherwise, 

Thus, a column j covers row i, only if its priority supply is higher than the priority demand of 
row i. The priority covering problem is now as follows. 

minjc^x : ^[s,7r]x > 1,0 < a; < d, x integer}. (Cov(^[s, tt], 1, c)) 

We believe that priority covering problems are interesting in their own right, and they arise 
quite naturally in covering applications where one wants to model quality of service (QoS) or 
priority restrictions. For instance, in the tree cover problem defined above, suppose each segment 
j has a quality of service (QoS) or priority supply Sj associated with it and suppose each edge 
e has a QoS or priority demand vrg associated with it. We say that a segment j covers e iff j 
contains e and the priority supply of j exceeds the priority demand of e. The goal is to find a 
minimum cost subset of segments that covers every edge. This is the priority tree cover problem. 

Besides being a natural covering problem to study, we show that the priority tree cover 
problem is a special case of a classical geometric covering problem: that of finding a minimum 
cost cover of points by axis-parallel rectangles in 3 dimensions. Finding a constant factor 
approximation algorithm for this problem, even when the rectangles have uniform cost, is a long 
standing open problem. 

We show that although the tree cover is polynomial time solvable, the priority tree cover 
problem is APX-hard. We complement this with a factor 2 approximation for the problem. 
Furthermore, we present constant upper bounds for the integrality gap of this PCIP in a number 
of special cases, implying constant upper bounds on the corresponding CCIPs in these special 
cases. We refer the reader to Section 11.21 for a formal statement of our results, which we give 
after summarizing works related to our paper. 

1.1 Related work 

There is a rich and long line of work ([10] [T2l [T8| [20l I21j ) on approximation algorithms for 
CIPs, of which we state the most relevant to our work. Assuming no upper bounds on the 
variables, Srinivasan [20] gave a 0(1 + log a)-approximation to the problem (where a is the 
dilation as before). Later on, Kolliopoulos and Young [16] obtained the same approximation 
factor, respecting the upper bounds. However, these algorithms didn't give any better results 
when special structure of the constraint matrix was known. On the hardness side, Trevisan |22j 
showed that it is NP-hard to obtain a (log a — 0(log log a))-approximation algorithm even for 
0,1-CIPs. 

The most relevant work to this paper is that of Kolliopoulos [13]. The author studies CCIPs 
which satisfy a rather strong assumption, called the no bottleneck assumption, that the supply 
of any column is smaller than the demand of any row. Kolliopoulos [13] shows that if one is 
allowed to violate the upper bounds by a multiplicative constant, then the integrality gap of the 
CCIP is within a constant factor of that of the orig inal 0,1-CIf|1]. As the author notes such a 
violation is necessary; otherwise the CCIP has unbounded integrality gap. If one is not allowed 
to violated upper bounds, nothing better than the result of [16j is known for the special case of 
CCIPs. 

^Such a result is implicit in the paper; the author only states a O(loga) integrality gap. 
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Our work on CCIPs parallels a large body of work on column-restricted packing integer pro- 
grams (CPIPs). Assuming the no-bottleneck assumption, Kolliopoulos and Stein |15] show that 
CPIPs can be approximated asymptotically as well as the corresponding 0,1-PIPs. Chekuri et 
al. [7] subsequently improve the constants in the result from [15]. These results imply constant 
factor approximations for the column-restricted tree packing problem under the no-bottleneck 
assumption. Without the no-bottleneck assumption, however, only polylogarithmic approxima- 
tion is known for the problem [6]. 

The only work on priority versions of covering problems that we are aware of is due to 
Charikar, Naor and Schieber [5] who studied the priority Steiner tree and forest problems in 
the context of QoS management in a network multicasting application. Charikar et al. present 
a 0(log n)-approximation algorithm for the problem, and Chuzhoy et al. [9] later show that no 
efficient o(loglogn) approximation algorithm can exist unless NP C DTIME(n^°s'°s^°s"') (n is 
the number of vertices). 

To the best of our knowledge, the column-restricted or priority versions of the line and tree 
cover problem have not been studied. The best known approximation algorithm known for both 
is the O(logn) factor implied by the results of [16] stated above. However, upon completion 
of our work, Nitish Korula [17J pointed out to us that a 4- approximation for column-restricted 
line cover is implicit in a result of Bar-Noy et al. [2]. We remark that their algorithm is not 
LP-based, although our general result on CCIPs is. 

1.2 Technical Contributions and Formal Statement of Results 

Given a 0,1-CIP Cov{A, b, c, d), we obtain its canonical LP relaxation by removing the integrality 
constraint. The integrality gap of the CIP is defined as the supremum of the ratio of optimal 
IP value to optimal LP value, taken over all non-negative integral vectors b, c, and d. The in- 
tegrality gap of an IP captures how much the integrality constraint affects the optimum, and is 
an indicator of the strength of a linear programming formulation. 

CCIPs: Suppose the CCIP is Cov(A[s], 6, c, d). We make the following two assumptions about 
the integrality gaps of the 0,1 covering programs, both the original 0,1-CIP and the priority 
version of the 0,1-CIP. 

Assumption 1. The integrality gap of the original 0,1-CIP is 7 > 1. Specifically, for any non- 
negative integral vectors b, c, and d, if the canonical LP relaxation to the CIP has a fractional 
solution X, then one can find in polynomial time an integral feasible solution to the CIP of cost 
at most 7 • (^x . We stress here that the entries of b, c, d could be as well as 00 . 

Assumption 2. The integrality gap of the PCIP is lo > 1. Specifically, for any non-negative 
integral vectors s,tt,c, if the canonical LP relaxation to the PCIP has a fractional solution x, 
then one can find in polynomial time, an integral feasible solution to the PCIP of cost at most 

T 

UJ ■ c X. 

We give an LP-based approximation algorithm for solving CCIPs. Since the canonical LP 
relaxation of a CCIP can have unbounded integrality gap, we strengthen it by adding a set 
of valid constraints called the knapsack cover constraints. We show that the integrality gap of 
this strengthened LP is 0(7 -|- to), and can be used to give a polynomial time approximation 
algorithm. 
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Theorem 1. Under Assumptions {1\ andl^ there is a {24:'y + 8u})- approximation algorithm for 
column-restricted CIPs. 

Knapsack cover constraints to strengthen LP relaxations were introduced in [T|llH[23j: Carr 
et al. [1] were the first to employ them in the design approximation algorithms. The paper of 
Kolliopoulos and Young [16] also use these to get their result on general CIPs. 

The main technique in the design of algorithms for column-restricted problems is grouping- 
and-scaling developed by Kolliopoulos and Stein [T5] for packing problems, and later used 
by Kolliopoulos [13] in the covering context. In this technique, the columns of the matrix are 
divided into groups of 'close-by' supply values; in a single group, the supply values are then 
scaled to be the same; for a single group, the integrality gap of the original 0,1-CIP is invoked 
to get an integral solution for that group; the final solution is a 'union' of the solutions over all 
groups. 

There are two issues in applying the technique to the new strengthened LP relaxation of our 
problem. Firstly, although the original constraint matrix is column-restricted, the new constraint 
matrix with the knapsack cover constraints is not. Secondly, unless additional assumptions are 
made, the current grouping-and-scaling analysis doesn't give a handle on the degree of violation 
of the upper bound constraints. This is the reason why Kolliopoulos [13] needs the strong 
no-bottleneck assumption. 

We get around the first difficulty by grouping the rows as well, into those that get most 
of their coverage from columns not affected by the knapsack constraints, and the remainder. 
On the first group of rows, we apply a subtle modification to the vanilla grouping-and-scaling 
analysis and obtain a 0(7) approximate feasible solution satisfying these rows; we then show 
that one can treat the remainder of the rows as a PCIP and get a 0{u) approximate feasible 
solution satisfying them, using Assumption 2. Combining the two gives the 0(7 + 0;) factor. 
The full details are given in Section 2. 

We stress here that apart from the integrality gap assumptions on the 0,1-CIPs, we do not 
make any other assumption (like the no-bottleneck assumption). In fact, we can use the modified 
analysis of the grouping-and-scaling technique to get a similar result as [13j for approximating 
CCIPs violating the upper-bound constraints, under a w;ea/cer assumption than the no-bottleneck 
assumption. The no-bottleneck assumption states that the supply of any column is less than the 
demand of any row. In particular, even though a column has entry on a certain row, its supply 
needs to be less than the demand of that row. We show that if we weaken the no-bottleneck 
assumption to assuming that the supply of a column j is less than the demand of any row i only 
if is positive, a similar result can be obtained via our modified analysis. 

Theorem 2. Under assumption {1\ and assuming AijSj < bi, for all i,j, given a fractional 
solution X to the canonical LP relaxation of Cov{A[s],b,c,d), one can find an integral solution 
x^^^ whose cost c • x^"* < 107(0 • x) and x^^^ < Wd. 

Priority Covering Problems. In the following, we use PLC and PTC to refer to the priority 
versions of the line cover and tree cover problems, respectively. Recall that the constraint 
matrices for line and tree cover problems are totally unimodular, and the integrality of the 
corresponding 0,1-covering problems is therefore 1 in both case. It is interesting to note that the 
0,1-coefficient matrices for PLC and PTC are not totally unimodular in general. The following 
integrality gap bound is obtained via a primal-dual algorithm. 
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Theorem 3. The canonical LP for priority line cover has an integrality gap of at least 3/2 and 
at most 2. 

In the case of tree cover, we obtain constant upper bounds on the integrahty gap for the 
case c = 1, that is, for the minimum cardinahty version of the problem. We beheve that the 
PCIP for the tree cover problem with general costs also has a constant integrality gap. On the 
negative side, we can show an integrality gap of at least 

Theorem 4. The canonical LP for unweighted PTC has an integrality gap of at most 6. 

We obtain the upper bound by taking a given PTC instance and a fractional solution to 
its canonical LP, and decomposing it into a collection of PLC instances with corresponding 
fractional solutions, with the following two properties. First, the total cost of the fractional 
solutions of the PLC instances is within a constant of the cost of the fractional solution of the 
PTC instance. Second, union of integral solutions to the PLC instances gives an integral solution 
to the PTC instance. The upper bound follows from Theorem [3l Using Theorem [H we get the 
following as an immediate corollary. 

Corollary 1. There are 0{1)- approximation algorithms for column-restricted line cover and the 
cardinality version of the column-restricted tree cover. 

We also obtain the following combinatorial results. 

Theorem 5. There is a polynomial-time exact algorithm for PLC. 

Theorem 6. PTC is APX-hard, even when all the costs are unit. 

Theorem 7. There is an efficient 2- approximation algorithm for PTC. 

The algorithm for PLC is a non-trivial dynamic programming approach that makes use of 
various structural observations about the optimal solution. The approximation algorithm for 
PTC is obtained via a similar decomposition used to prove Theorem [H 

We end by noting some interesting connections between the priority tree covering prob- 
lem and set covering problems in computational geometry. The rectangle cover problem in 
3-dimensions is the following: given a collection of points P in R^, and a collection C of axis- 
parallel rectangles with costs, find a minimum cost collection of rectangles that covers every 
point. We believe studying the PTC problem could give new insights into the rectangle cover 
problem. 

Theorem 8. The priority tree covering problem is a special case of the rectangle cover problem 
in dimensions. 

2 General Framework for Column Restricted CIPs 

In this section we prove Theorem [TJ Our goal is to round a solution to a LP relaxation of 
Cov(A[s], 6, c, d) into an approximate integral solution. We strengthen the following canonical 
LP relaxation of the CCIP 

minjc^x : A[s\x > 6, < x < d, x > 0} 

by adding valid knapsack cover constraints. In the following we use C for the set of columns and 
TZ for the set of rows of A. 
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2.1 Strengthening the canonical LP Relaxation 

Let F C C be a subset of the columns in the column restricted CIP Cov{A[s],b,c,d). For all 
rows i € TZ, define hf = max{0,6i — X^jeF ^Hu'^j} residual demand of row i w.r.t. 

F. Define matrix ^^[s] by letting 

aF^. _ / min{A[s]ij,6f} : jeC\F , 
^ i^J^i - \ : J € F, 

for alH € C and for all j € TZ. The following Knapsack- Cover (KC) inequality 

iec 

is valid for the set of all integer solutions x for Cov{A[s\,b,c,d). Adding the set of all KC 
inequalities yields the following stronger LP formulation CIP. We note that the LP is not column- 
restricted, in that, different values appear on the same column of the new constraint matrix. 

optp := min CjXj (P) 

s.t. ^ A''[s\ijXj >bf yFCC,yie-R (3) 

iec 

0<Xj < dj Vj G C 

It is not known whether ([Pj can be solved in polynomial time. For a G (0, 1), call a vector x* 
a-relaxed if its cost is at most optp, and if it satisfies ([3]) for F = {j £ C : x* > adj}. An 
a-relaxed solution to ([P]) can be computed efficiently for any a. To see this note that one can 
check whether a candidate solution satisfies ([3]) for a set F; we are done if it does, and otherwise 
we have found an inequality of ([P]) that is violated, and we can make progress via the ellipsoid 
method. Details can be found in [4J and [16j. 

We fix an a G (0,1), specifying its precise value later. Compute an a-relaxed solution, x*, 
for (jP|), and let F = {j £ C : x* > adj}. Define x as, Xj = x* if j G C \ F, and Xj = 0, 
otherwise. Since x* is an a-relaxed solution, we get that x is a feasible fractional solution to the 
residual CIP, Cov(yl^[s], , c, ad). In the next subsection, our goal will be to obtain an integral 
feasible solution to the covering problem Cov{A^[s],b^ ,c,d) using x. The next lemma shows 
how this implies an approximation to our original CIP. 

Lemma 1. // there exists an integral feasible solution, x^^^ , to Cov{A^[s], b^ , c, d) with c'^x^"* < 
/3 • c^x, then there exists a max{l/a, /3}-/actor approximation to Cov{A[s],b,c,d). 

Proof. Define 

d, : jeF 
x)'^* : j £C\F, 

Observe that z < d. z is a feasible integral solution to Cov(^[s], b, c, d) since for any i £ TZ, 
^A[s]ijZj = ^A[s]ijdj+ Yl A[.s]ijx)^' >{h-bf)+ Yl A^[s]ij^r>bi 

jeC jdF j(^C\F j(iC\F 
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where the first inequahty follows from the definition of bf and since > A^[s]ij, the second 

inequality follows since x^^^ is a feasible solution to Cov{A^[s],b^ ,c,d). 

Furthermore, 

c^z = Cjdj + CjX^'^* < — CjX* + /3 CjX* < max{— , /3}optp 
jeF jeC\F ^ jdF jeC\F " 

where the first inequality follows from the definition of F and the second from the assumption 
in the theorem statement. □ 

2.2 Solving the Residual Problem 

In this section we use a feasible fractional solution x of Cov {A^ [s], ,c, ad), to obtain an inte- 
gral feasible solution to the covering problem Cov{A^[s],b^ ,c,d), with c^x^^^ < ^c^x for 
^ = 247 + 8lo. Fix a = 1/24. 

Converting to Powers of 2. For ease of exposition, we first modify the input to the residual 
problem Cov{A^[s], b^ , c, d) so that all entries of are powers of 2. For every i £TZ, let bi denote 
the smallest power of 2 larger than bf . For every column j ^ C, let Sj denote the largest power 
of 2 smaller than Sj. 

Lemma 2. y = 4x is feasible for Cov{A^[s],b, c, 4ad). 
Proof. Focus on row i £ TZ. We have 

A^[s]ijyj > 2 • ^ A^[s]ijXj > 2bf > h, 

where the first inequality uses the fact that sj < 2s j for all j € C, the second inequality uses 
the fact that x is feasible for Cov{A^[s],b^ ,c,ad), and the third follows from the definition of 
bi. □ 

Partitioning the rows. We call bi the residual demand of row i. For a row i, a column 
j € C is i-large if the supply of j is at least the residual demand of row i] it is i-small otherwise. 
Formally, 

Ci = {j G C : Aij = l,Sj >bi} is the set of i-large columns 
Si = {j G C : Aij = l,Sj <bi} is the set of z-small columns 

Recah the definition from ([2]), A^[s]jj = mm.{A[s]ij,bf). Therefore, A^[s]ij = Aijbf for all 
j G Ci since Sj >bi>bf; and A^[s]ij = Aijsj for all j G Si, since being powers of 2, sj < bi 
implies, sj <bi/2 < bf . 

We now partition the rows into large and small depending on which columns most of their 
coverage comes from. Formally, call a row i ^TZ large if 

and small otherwise. Note that Lemma [2] together with the fact that each column in row i's 
support is either small or large implies. 

For a large row i, A^[s]ijyj > bi/2, For a small row i, A^[s]ijyj >bi/2 
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Let TZl and TZs be the set of large and small rows. 

In the following, we address small and large rows separately. We compute a pair of integral 
solutions x^'^*''^ and x^^^''~' that are feasible for the small and large rows, respectively. We then 
obtain x^^^ by letting 

int r int,t5 intXi /c-\ 

Xj = maxjx^- ' ,Xj ' }, (5) 

for all j G C. 

2.2.1 Small rows. 

For these rows we use the grouping-and-scaling technique a la [71 [131 [IS [13- However, as 
mentioned in the introduction, we use a modified analysis that bypasses the no-bottleneck 
assumptions made by earlier works. 

Lemma 3. We can find an integral solution x^^^'^ such that 
a) x^"*''^ < dj for all j, 

jgc CjXj, and 

c) for every small row i G TZs, Z^jeC — bf ■ 

Proof. The complete proof is slightly technical and hence we start with a sketch. Since the 
rows are small, for any row i, we can zero out the entries that are larger than 6j, and still 2y will 
be a feasible solution. Note that, now in each row, the entries are < bi, and thus are at most bi/2 
(everything being powers of 2). We stress that it could be that bi of some row is less than the 
entry in some other row, that is, we don't have the no-bottleneck assumption. However, when 
a particular row i is fixed, bi is at least any entry of the matrix in the ith row. Our modified 
analysis of grouping and scaling then makes the proof go through. 

We group the columns into classes that have sj as the same power of 2, and for each row i we 

let bf^ be the contribution of the class t columns towards the demand of row i. The columns of 
class t, the small rows, and the demands bf^ form a CIP where all non-zero entries of the matrix 

are the same power of 2. We scale both the constraint matrix and bf^ down by that power of 2 
to get a 0,1-CIP, and using assumption 1, we get an integral solution to this 0,1-CIP. Our final 
integral solution is obtained by concatenating all these integral solutions over all classes. 

Till now the algorithm is the standard grouping-and-scaling algorithm. The difference lies 
in our analysis in proving that this integral solution is feasible for the original CCIP. Originally 
the no-bottleneck assumption was used to prove this. However, we show since the column values 
in different classes are geometrically decreasing, the weaker assumption of bi being at least any 
entry in the ith row is enough to make the analysis go through. We now get into the full proof. 

Step 1: Grouping the columns. Let Smin and Smax be the smallest and largest supply 
among the columns in C \ F. Since all Sj are powers of 2, we introduce the shorthand, s*-*^ for 
the supply Sinax/2*. We say that a column j is in class t > 0, if sj = s^*\ and we let 

■={jeC\F : Sj- =sW} 

be the set of class t supplies. 
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Step 2: Disregarding z-large columns of a small row i. Fix a small row i S TZs- We now 

identify the columns j that are z-small. To do so, define ti := log{smax/bi) + 1. Observe that any 
column j in class C*-*^ for t >ti are i-small. This is because sj = Smax/'^^ < Smaxf^^^ = bi/2 < bi. 
Define 

lit) ^ I 2^^.gc(t) : t>ti 

* \ : otherwise 

as the contribution of the class t, i-small columns to the demand of row multiplied by 2. Note 
that by definition of small rows, these columns contribute to more than 1/2 of the demand, and 
so 

Y.~^^^h. (6) 

t>ti 

Henceforth, we will consider only the contributions of the small i-columns of a small row i. 

Step 3: Scaling and getting the integral solution. Fix a class t of columns and scale 
down by s*-*^ to get a {0, l}-constraint matrix. (Recall entries of the columns in a class t are 
all s^^\) This will enable us to apply assumption 1 and get a integral solution corresponding 
to these columns. The final integral solution will be the concatenation of the integral solutions 
over the various classes. 

The constants in the next claim are carefully chosen for the calculations to work out later. 

Claim 1. For any t>0 and for all iens,6- EjecW ^ijVj ^ [3^? Vs^*^J • 

Proof. The claim is trivially true for rows i with ti > t as hf^ = in this case. Consider a row 
i with ti < t. Since any column j G C*^*^ is i-small, we get = Aijsj = AijS^^\ Using the 

definition of bi, we obtain 

Dividing both sides by s^*) and taking the floor on the right-hand side yields the claim. □ 

Since a = 1/24 and x is a feasible solution to Cov{A^[s],b^ ,c,d/24), we get that Gyj = 
24 -Xj < dj for all j C\F. Thus, the above claim shows that 6y is a feasible fractional solution 
for Cov(A^^\ [36^*^/s^*^J , c*-*^ , d^*^), where A^^^ is the submatrix of A defined by the columns in 
C(*) , and c(*) and d^*) are the sub- vectors of c and d, respectively, that are induced by C^^\ Using 
Assumption [H we therefore conclude that there is an integral vector x^"*'*^'* such that 



in.t,S,t ^ ^ j ^ C^t) ^ 



36 



it) 



for all i £ TZs, and 



c^x. < 67- 2^ Cjyj 



(7) 
(8) 

(9) 



We obtain integral solution x^'^*"'^ by letting x^^'^'^ = x^"*'*^'* if j G Thus x^"*'"^ < dj for 
all j £ C, and we get, 



E 



CjXj 



int,tS 



Yl Yl ^i^T'^'' < 67 • 2^ Yl ^i^j- = 247 • ' 

t>o jec^t) t>o jfzcw jec 



(10) 



10 



Thus we have estabUshed parts (a) and (b) of the lemma. It remains to show that 
feasible for the set of small rows. 

Step 4: Putting them all together: scaling back. Once again, fix a small row i E TZs- 
The following inequality takes only contribution of the i-small columns. We later show this 
suffices. 

j^C: j is i-small 

= E E ^^^-r' ^ E E ^s^-r'' (11) 

t>ti j^c^t) t>ti j(zcW 

The first inequality follows since = AijSj for z-small columns, the equality follows from 

the definition of tj, and the final inequality uses the fact that Sj > s^*-* for j G C^^\ The following 
claim along with (llip proves feasibility of row i. This is the part where our analysis slightly 
differs from the standard grouping-and-scaling analysis. 

Claim 2. For any small row i G TZs, 

E E A^s^^-r'^'^bf. 

Proof. In this proof, the choice of the constant 3 on the right-hand side of the inequality in 
Claim [1] will become clear. Let Si = {t > ti : 3bf^ < s^*)} be the set of z-small classes t whose 
fractional supply bf^ is small compared to its integral supply s^*^. We now show that for any 
small row i, the columns in the classes not in Si suffice to satisfy its demand. Note that 

E ^ = E^-E^>E^-lE^'' (12) 

t^Si,t>u t>ti teSi t>ti teSi 
which follows from the definition of Si. Furthermore, from ([6]) we know that for a small row, 
St>ti bf^ — bi- Also, since s^*^ form a geometric series, we get that "^^it&s^ — St>tj — 
2s(*'). Putting this in (|12p we get 

E ~^^>b^-\Y.-^^''>h-\s^'^' = \b. (13) 
t^Si,t>U t>ti 

where the final equality follows from the definition of ti which implies that s^*'-* = bi/2. 

Moreover, for t ^ Si, we know that [Sbj/s'^^^] > |6*/s(*) since [aj > a/2 if a > 1. Therefore, 
using inequality ([8]) in (fTTj) . we get 

3b^ 



> bi>bf, 



where the second-last inequality uses (|13p . and the last uses the definition of 6j. This completes 
the proof of the lemma. □ 

□ 



11 



2.2.2 Large rows. 

The large rows can be showed to be a PCIP problem and thus Assumption 2 can be invoked to 
get an analogous lemma to Lemma [3j 



Lemma 4. We can find an integral solution x^"*''^ such that 

it 



a) x^'^*''^ < 1 for all j , 



c) for every large row i G TZl, X]jgc W«i^j°*''^ — • 
Proof. Let i € TZl be a large row, and recall that Ci is the set of i-large columns in C. We have 

and hence 

Let A'^ be the minor of A induced by the large rows. Consider the priority cover problem 
Cov(^^[s, b], 1, c). Prom the definition of £j, it follows 2y is a feasible fractional solution to the 
priority cover problem. 

Using Assumption[2l we conclude that there is an integral solution x^^*'-^ such that Ylj^c '^J^T^^ 

2a; E jGC '^jUj — ^^Y^jec ^3^3^ ^'^^ ^jec > I7 for ah large rows i gTZl- 

Fix a large row i. Since A^[s]ij = bf for all i-large columns Ci, we get 

This completes the proof of the lemma. □ 

Proof of Theorem [1] Let x^^*'*^ and x^^*'"^ be as satisfying the conditions of Lemma [3] and HI 
respectively. Define x^'^* as = maxjxj"*'*^, x^^*'"^}. We have 

a) x^"* < dj since both x^'^*''^ < dj and x^^'^^ <^1^dj. 

b) For any row i, X^jgc ^'^[■^lij^;)"* > &f since the inequality is true with x^'^* replaced by x^'^*''^ 
for small rows, and x^"* by x^^*'-^ for large rows. 

int,(S I int,/Z 



c) Ej6C Cj^^f * < Ejec (^3^T ' + Sjec Cja;^ ' < (247 + 8uj) Y^jec '^3^3 ■ 



< 



Thus, x^^* is a feasible integral solution to Cov(A [s], 6 , c, d) with cost bounded as Ylj^c '^j^Y 

gC '^i^i- Noting that a — 1/24, the proof of the theorem follows from Lemma [TJ 

□. 

2.3 CCIPs with violation of upper-bounds: Proof of Theorem [2] 

In this section we prove Theorem [2] that we restate here. In the proof, we will indicate how we 
modify the analysis of grouping-and-scaling that allows us to replace the no-bottleneck assump- 
tion with a weaker one. 
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Theorem 9. (Theoreml^ Under assumption^ and assuming AijSj < hi, for all given a 
fractional solution x to the canonical LP relaxation of Cov{A[s],b, c, d) , one can find an integral 
solution x^"* whose cost c ■ x^"* < 107(c • x) and x^^^ < Wd. 

Proof. Let a; be a feasible solution to 74[s]x > b, x > 0. We construct an integral solution x^"* 
such that j4[s]x^"* > b and c'^x^^* < Wjc'^x. Let Smax and Smin be the largest and smallest Sj's. 

Grouping: Let := {j : 2-^^+'^hmax < sj < 2~^Smax} for t = 0, 1, . . . T where T = 
•= Sjecw^ii^i^i- Note that ^^=0^* > bi- Let m* := min^.gcW:A,,^o ^i^y ' 
that is, m* is the smallest non-zero entry of the ith row of A in the columns of C^^\ Note that 
> 2~(*+-^)smax- Let m, be the largest entry of row i. The assumption AijSj < bi implies 



m. 



m < bi. 

Scaling: Let y* be a vector with y*- = lOxj for j S C^^\ elsewhere. Note that c'^y* = lOc-^x 
and yj < lOdj for any i. Let s* be a vector with s*- = 2^(*+^)smaa; for j € C''*^ otherwise. Since 
for all j € C^*\ s*- > Sj/2, for all rows i we have 



Therefore since m* > 2 '-^^^'^-'smaxj we get 

. / 56* 56* 56* 

2^ - 2-(*+i)s„,, - m* - m* 



If we define an integral vector a* to be a* := [^J, we see that Ay^ > a*. Using assumption [U 

i 

there exists an integral solution z* such that Az^ > a*, and c^z* < 7(c"^y*), and z* < lOdj. 
Scaling back: Now fix a row i, and look at 

56* 

where the first inequality follows since m* is the minimum entry in the ith row in the columns 
of C*-*\ This is where our analysis slightly differs from the previous analyses of grouping and 
scaling, where instead of multiplying the RHS by m*, the RHS was multiplied by 2~*Smax- This 
subtle observation leads us to make a weaker assumption than the no-bottleneck assumption. 

Getting the final integral solution: 

Define x^^* := Y^J^qzK Note that c^x^^* = Xlt c^-^* < lYjt^^^V* = 107(c^x) and x^^* < lOd. 
Fix a row i and look at the ith entry of j4[s]x^^*. 
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Let 5i := {t : 56* < m*}. Note that 

the second inequahty following from Claim [3] below. This gives us 

T 

^ 6* > ^ 6* - 3mi/5 >bi- Sm^/S 

For t ^ Si, we have the floor in the inequality (jl5p at least 1. So we can use the relation 
[x\ > x/2 for X > 1. Thus, using rrii <bi, we have 

Ax^^* > ^ — > 56i/2 - 3mi/2 > h 

Claim 3. Yl't=o^i — 

Proof. Note that the non-zero m* decreases as t goes from to T. Also, for any t < t', we have 
mj > 2^^^^^'' Smax and m* < 2~* Smax- Thus, m* < m* • 2^^* Since the largest m* can be 

at most rrii, Ylt=o fn\ <mi+ mi-\- mi/2 + rrii/A + .... < 3mj. □ 

□ 



3 Priority line cover 

We first show that the integrality gap of the canonical linear programming relaxation of PLC is 
at least 3/2 and at most 2. Subsequently, we present an exact combinatorial algorithm for the 
problem. 

3.1 Canonical LP relaxation: Integrality gap 

We start with the canonical LP relaxation for PLC and its dual in Figured! 

min I ^ CjXj : x G i?f (Primal) max | ^ ye '■ y G i?f (Dual) 

je<S:j covers e eGi?:j covers e 

Figure 1: The PLC canonical LP relaxation and its dual. 

The following example shows that the integrality gap of (jPrimaip is at least 3/2. 

Example 1. Figure [1] shows a line of odd length k; odd numbered edges have demand 1, and 
even numbered edges have a demand of 2. Paths are shown as lines above the line graph, and are 
also numbered. Odd numbered paths have a supply of 2, and even numbered ones have a supply 
of 1. Dashed lines indicate edges spanned but not covered. All paths have cost 1. Note that a 
fractional solution is obtained by letting Xp = 2/3 for paths 2 and and Xp = 1/3 otherwise. 
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The cost of this solution is (A; + 3)/3, while the best integral solutions takes all odd-numbered 
paths, and has cost {k + l)/2. As k tends to oo, the ratio between the integral and fractional 
optimum tends to 3/2. As an aside, we found the above integrality gap instance by translating 
a known integrality-gap instance of the tree- augmentation problem in caterpillar graphs; see [8]. 

k+l. . 

k . . 

8. . 

7 . . 

6 . . 

5 ■ ■ 

4 . . 

3 . . 

1 . . 

1 2 3 4 5 6 7 8 k-2 k-1 k 

Figure 2: Integrality Gap for PLC 

We now show that the integrality gap of the canonical LP for PLC is bounded by 2. We describe 
a simple primal-dual algorithm that constructs a feasible line cover solution and a feasible dual 
solution, and the cost of the former is at most twice the value of the dual solution. 

The algorithm maintains a set of segments Q. Call an edge e unsatisfied if no segment in Q 
covers e. Let U be the set of unsatisfied edges. Initially Q is the empty set and U = E. We grow 
duals He on certain edges, as specified below. We let denote the edges with positive ye] we call 
such edges, positive edges. Initially is empty. Call a segment j tight if X^eGj j covers eVe ~ '^j- 
We use the terminology an edge e is larger than /, if tTe > ttj. 



Primal-Dual Algorithm 

1. While U is not empty do 

• Breaking ties arbitrarily, pick the largest edge e in U. 

• Increase ye till some segment becomes tight. Note that each such segment must 
contain e. Let ji{e) and jr(e) be the tight segments that have the smallest left- 
end-point and the largest right-end-point, respectively. Since e is chosen to be the 
largest uncovered edge, any unsatisfied edge contained in the two segments j/(e) or 
jV(e) is also covered. We say e is responsible for ji{e) and jr(e). 

Add ji{e),jr{e) to Q. Add e to E^. Remove all the unsatisfied edges contained in 
either ji{e) or jr{e) from U. 

2. Reverse Delete: Scan the segments j in Q in the reverse order in which they were 
added, and delete j if its deletion doesn't lead to uncovered edges. 

It is clear that the final set Q is feasible. It is also clear that y forms a feasible dual. The 
factor 2-approximation follows from the following lemma by a standard relaxed complementary 
slackness argument, and this finishes the proof of Theorem [3l 

Lemma 5. Any edge e € E^ is covered by at most two segments in Q. 

Proof. Suppose there is an edge e G E^ covered by three segments ji,j2 and j^. Observe that 
one of the segments, say j2, must be completely contained in ji U j^. Since j2 is not deleted 
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from Q, there must be an edge / € j2 such that j2 is the only segment in Q covering /. Since 
ji and ^3 don't cover /, but one of them, say ji contains it, this imphes vrj > sup^^ > vrg. That 
is, / is larger than e. 

If / is the edge responsible for j2, then since j2 contains e, e wouldn't be in E+. Since / 
is larger than e, there must be a segment j in Q added before j2 that covers /. In the reverse 
delete order, j2 is processed before j. This contradicts that j2 is the only segment in Q covering 
/• □ 

Lemma 6. EjeQ^i < '^Y^eGEVe- 

Proof. Since each s e Q satisfies Y^eej-.j covers eVe = ^j, we get 

^ = ^ J2 ye = ^ ye • |{j G Q : j covers e}\ < 2 ^ 

j€Q j&Q^&j'-j covers e eGE eeE 

□ 

3.2 An Exact Algorithm for PLC 

We first describe the sketch of the algorithm; the full proof starts from Section [3?2Tl A segment 
j covers only a subset of edges it contains. We call a contiguous interval of edges covered by 
j, a valley of j. The uncovered edges form mountains. Thus a segment can be thought of as 
forming a series of valleys and mountains. 

Given a solution C 5 to the PLC (or even a PTC) instance, we say that segment j € S" 
is needed for edge e if j is the unique segment in S that covers e. We let Esj be the set of 
edges that need segment j. We say a solution is valley-minimal if it satisfies the following two 
properties: (a) If a segment j is needed for edge e that lies in the valley v of j, then no higher 
supply segment of S intersects this valley v, and (b) every segment j is needed for its last and 
first edges. We show that an optimum solution can be assumed to be valley-minimal, and thus 
it suffices to find the minimum cost valley-minimal solution. 

The crucial observation follows from properties (a) and (b) above. The valley-minimality of 
solution S implies that there is a unique segment j € S that covers the first edge of the line. 
At a very high level, we may now use j to decompose the given instance into a set of smaller 
instances. For this we first observe that each of the remaining segments in S'\ {j} is either fully 
contained in the strict interior of segment j, or it is disjoint from j, and lies to the right of it. 
The set of all segments that are disjoint from j form a feasible solution for the smaller PLC 
instance induced by the portion of the original line instance to the right of j. On the other 
hand, we show how to reduce the problem of finding an optimal solution for the part of the line 
contained in j to a single shortest-path computation in an auxiliary digraph. Each of the arcs 
in this digraph once again corresponds to a smaller sub-instance of the original PLC instance, 
and its cost is that of its optimal solution. The algorithm follows by dynamic programming. 

3.2.1 Valley-Minimal Solutions 

As mentioned above, it helps to think of supplies and demands as heights. In the case of PLC, 
the demands of the edges in E form a terrain, and each segment j & S corresponds to a straight 
line at height Sj. Segment j then covers edge e if e lies in the segment's shadow, that is, the 
height of e is smaller than the height of the segment. 
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Figure 3: The figure shows a segment j, and the terrain induced by the edges of E that it 
contains. The terrain partitions j into valleys and mountains. Valleys are indicated by solid 
parts of j, and mountains are shown as dashed lines. 

Figure [3] illustrates this with path P and its edges. The light gray terrain indicates the 
demands of the edges. The segment j shown in the picture covers the edges in [l,r] that lie in 
its shadow; e.g., j covers edge e but not e' . The terrain partitions j naturally into valleys - 
contiguous sub- intervals of [I, r] that are in the shadow of j, and mountains - those sub- intervals 
that are contained in [l,r] and consist entirely of edges that are not covered by j. The parts of 
j that correspond to mountains are indicated by dashed lines, and valleys are depicted by solid 
lines. In the following, we let [l'l,rl.] be the interval corresponding to the kth valley of j. 

In the following, we will assume that the set of segments S in the given PLC /PTC instance is 
segment- complete; i.e., if S contains the segment j then it also contains all proper sub-segments. 
For example, if a PLC instance contains segment j corresponding to interval [P, r-'], then it also 
contains segments corresponding to intervals [l,r] for all F < I < r < . This assumption is 
w.l.o.g. as we can always add a dummy sub-segment j' for any such interval [I, r] with the same 
supply and cost as j. Any minimal solution clearly uses at most one of j and j' , and if j' is 
used, then replacing it with j does not affect feasibility. 

Let 5 C 5 be an inclusion-wise minimal solution for the given instance, and let j G 5" be 
any one of its segments. We say that j is needed for edge e € E ii j covers e, and if there is no 
other segment in S that covers e; let Esj be the set of edges that need j, and hence Esj ^ 
for all j G S. Thus, if j is needed for e, then e is in one of j's valleys; we let valg be that valley. 

A solution C 5 is valley-minimal if 

[Ml] for all j (z S and for all e € Esj, no segment of higher supply in S covers any of the edges 
in val^, and 

[M2] each segment is needed for its first and last edge. 

We obtain the following observation. 

Lemma 7. Given a feasible instance of PLC/PTC, there exists an optimum feasible solution 
that is valley-minimal. 
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Proof. First, it is not too hard to see that we can always obtain an optimal solution that satisfies 
[M2]. If S is an optimum solution with a segment j, and j is not needed for its first or last edge 
e, then we may clearly replace j by the sub-segment j — e. This does not increase the solutions 
cost, using the segment-completeness. 

Assume, for the sake of contradiction that S violates [Ml]. For a solution S C S, say that 
{j,j',e) is a violating triple if € S, j' has higher supply than j, j is needed for e, and / 
covers some edge in val;^. Choose a solution S with the smallest number of violating triples 
and let {j,j',e) be one such triple. Since j is needed for e, edge e is not contained in j', and 
hence j' is either fully contained in the interval (e,n] or fully contained in the interval [l,e). 
Using the segment-completeness assumption, we may replace j' by the sub-segment j" obtained 
by removing the prefix consisting of edges in val^; remove j" if it is empty. The resulting 
set of segments has cost at most that of S, and the number of violating triples is smaller; a 
contradiction. □ 

In the next subsection, we show how we can compute the minimum cost valley-minimal 
solution for PLC instances in polynomial time using dynamic programming. 



3.2.2 Computing valley-minimal solutions 

Given 1 < / < r < n, we obtain the sub-instance induced by interval [/, r] by restricting the line 
[1, n] to this interval, and by keeping only segments that are fully contained in [l,r]. Observe that 
the valley-completeness assumption implies that any such sub-instance is feasible. We begin by 
making a crucial observation that will allow us to decompose a given PLC instance into smaller 
instances. Let be a valley-minimal solution for the sub-instance induced by [l,r], and note 
that [M2] implies that S contains a unique segment j that covers the first edge {1,1 + 1). Suppose 
that Esj = {ei, . . . ,ek} is the set of edges within [l,r] that need segment j. Abusing notation 
slightly, we let val] = [^i,?^^] be the valley of j around edge e^; thus we clearly have 

£;5j C val{ U . . . Uval^. (16) 

Note that segment j may have valleys that entirely consist of edges that do not need j; accord- 
ingly, such valleys are not part of the list on the right-hand side of (jl6p . Using property [M2], 
however, we may assume that val{ and val|. are the first and last valley, respectively, of segment 
j. We obtain the following observation, where we let Z^^^ = r + 1. 

Observation 1. We may assume, for all 1 < i < k, if j' G S contains e E {rl,ll^i), then j' is 
fully contained in (rlJl^-^). 

Proof. Consider first a segment j' € S with supply bigger than sj. In this case [Ml] implies that 
j' must have an empty intersection with the valleys valj, . . . , val;^, and the observation follows. 

On the other hand if segment j' has supply at most Sj, then since j' must be needed for 
some edge e, j must not contain e implying j' must have its right end-point in (r^, r]. Replacing 
j' by its intersection with (r^,r] completes the observation. □ 

We now let OPT/^^ be a minimum cost valley-minimal feasible solution for the sub-instance 
induced by interval [l,r], and we let opt^ ^ be its cost. Clearly, OPT„^,i consists of the minimum 
cost segment in S that covers edge n, and OPTi^„ is the optimum solution we want to obtain. 
Suppose that we know OPT;/ ,.' for all I < I' < r' < r. The high level idea is the following. The 
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Figure 4: The part of digraph Gi corresponding to segment j (z Si. 

algorithm guesses the first segment j in OPT; Suppose that r' < r is the rightmost edge covered 
by j. Observation [1] allows us to partition the remaining segments in OPT; ,, into two parts: 

Part 1 Segments that contain edges in (r',r]. None of these segments can contain any of the 
edges in [l,r'] by the observation. 

Part 2 Segments that contain edges in [l,r']. Once again, the observation implies that such 
segments must be fully contained in {l,r'). 

The first part's solution is obtained since it is a smaller subproblem, the second part is obtained 
via a shortest-path computation. We now elaborate and give the complete algorithm. 

Let Si be the segments in S with leftmost endpoint /. We construct a digraph Gi as follows. 
Consider a segment j £ Si, and let 

be the set of its valleys. We add a node Vq for each valley 1 < g < /c of j to G^. We also add an 
arc {vq,v-j^,) for all 1 < q < q' < k. A shortest path corresponding to the solution OPT; ^ will use 

arc if 

(i) j is the leftmost segment in OPT;^,., and 

(ii) val^ and val^, are two consecutive valleys of j that contain edges that need j. 

Observation [T] then states that OPT;^^ uses segments that are entirely contained in (rg,/^,) to 
cover (ri, V ,). An optimum set of such segments is given by OPT j , ,j _, , and we therefore give 

arc (vLvL) cost opt j , ,j _, . Figure H] shows the part of Gi for the segment s from Figure El 

We add a source node s; and arcs {si,vl) of cost Cj for each of the segments j G Si. A shortest 
path uses such an arc if j is the unique segment starting at I in the corresponding optimum 
solution. We also add a sink node tr and add an arc {vl,tr) for all j G 5; of cost opt j , 

k ' 

indicating the optimum PLC for the sub-interval [r^ + l,r]. Note that if = r, then this arc 
is a loop of cost and can be discarded. 

It follows from the above construction that opt; is equal to the cost of a shortest s;,tr.-path 
in G;. Each of the shortest-path computations can clearly be done in polynomial time, and 
hence opt; ^ can be obtained via dynamic programming, in polynomial time. This yields the 
following restatement of Theorem \5\ 
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Theorem 10. The cost opt^ ^ of an optimum solution for a given PLC instance can be computed 
in polynomial time. 

4 Priority tree cover 

We first give a proof of Theorem [6l and show that rooted PTC is APX-hard, even if all segments 
have unit cost. Subsequently, we present a 2- approximation algorithm for the problem, by 
reducing it to an auxiliary instance of the tree augmentation problem. Then, we prove Theorem 
m and show that the integrality gap of the canonical LP formulation of unweighted PTC is 
bounded by 6. Finally, we prove the connection between PTC and the rectangle cover problem. 

4.1 APX-hardness 

We prove APX-hardness of PTC via a reduction from the minimum vertex cover problem in 
bounded degree graphs. The latter problem is known to be APX-hard [3j. Given a bounded 
degree graph G{V, E), with n vertices and m = 0{n) edges, let the edges be arbitrarily numbered 
{1,2,... ,m}. 

The tree in our instance has a broom structure: it has a handle which is a path of m 
edges (ei, . . . ,em) given by vertices {xo,xi, . . . ,Xm}, and it has n bristles where each bristle 
corresponds to a particular vertex v £ V and is a path of length deg{v). The edge in the handle 
for 1 < i < m, corresponds to the edge numbered i in the graph G. The bristle corresponding 
to vertex u is a path {ff, /|, . . . , fdeg(v)) S^^^^ ^he vertices {xm, y\,yl,..., Vdegiv)}- The root 
of the tree is xq, the end point of the handle. Thus the tree has m + deg{v) = 3m edges. 

We now describe the priority demands of these tree edges. The demand of edge Cj is i. 
Consider the edges in G incident on v in the decreasing order of their numbers. Suppose they 
are (ii > ^2 > • • • > ideg{v))- The demands of the edge fj is ij. Thus, for a particular bristle 
corresponding to a vertex v, the demands decrease as we go from /{" to f^^gi^^y and these demands 
correspond to the numbers of edges incident on v. 

Now we describe the segments. All segments have unit cost. We have two kinds of segments: 
edge segments and vertex segments. For every edge i = (f , w) in E, there are two edge segments 

and s^. Segments sj, contains all edges Cj to Cm and edges fl to /J, where edge i is the jth 
edge in the descending order of neighbors of v in G. The supply of segment s\, is i, and thus by 
construction, we see that only spans edge Cj and /J. That completes the description of edge 
segments. For every vertex v, there is a vertex segment t^ that covers all the edges in the bristle 
corresponding to vertex v. That completes the description of the PTC instance. Look at figure 
[5] for an illustration of the reduction. 

The following lemma along with the APX-hardness of the vertex cover problem in bounded 
degree graphs, and the fact that in the latter any vertex cover is of size Vl{n), leads to the 
APX-hardness of the PTC problem. 

Lemma 8. The optimum PTC of the above instance is m + k, where k is the size of the optimum 
vertex cover of G. 

Proof. Firstly note that we may assume that in any optimal PTC, for any edge i = {v,w), we 
will have exactly one of s^ or sjj, in the solution. We need to have one since these are the only 
two segments that cover edge et in the tree. Instead of picking both, we can remove one, say 
s^, from the solution and pick the corresponding vertex segment t^ instead, at no increase of 
cost. Therefore, there are exactly m edge segments picked in any optimal PTC solution. 
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(i) (ii) 

Figure 5: (i) shows an instance of the vertex cover problem, and (ii) is the corresponding PTC 
instance. The numbers on the edges are the priority demands corresponding to the edge numbers 
in the graph. Figure (ii) also shows two segments: and s^, having supplies 1 and 3 respectively. 
Dashed line means that these segments do not have enough supply to cover the edges. 

Now note that these m edge segments uniquely correspond to an orientation of the edges 
in G; if for edge i = {v,w), s\, is chosen in the solution, the edge {v,w) is oriented from w to 
V. In this orientation, if there is a sink (a vertex with all edges incident to it) f , then note 
that all the edges in the bristle corresponding to v have also been covered. Thus, the number 
of vertex segments required to cover the remaining edges of the tree, is precisely the number 
of non-sinks in this orientation. In particular, the optimal PTC corresponds to the orientation 
that minimizes the number of non-sinks. 

The proof is complete by noting that non-sinks form a vertex cover; this is because each edge 
is oriented away from some non-sink, and is thus incident to it. Furthermore, given a vertex 
cover, there exists an orientation with precisely these vertices as non-sinks. Orient the edges 
towards the complement of the vertex cover (the independent set) - the complement is precisely 
the set of sinks, and thus the vertex cover is precisely the set of non-sinks. □ 

Proof of Theorem\^ Suppose the degrees of G are all a constant. Note that the vertex cover 
of this graph is at least m/B = n/2. The APX-hardness implies that it is NP-hard to distinguish 
between the case when the vertex cover is cin or C2n where C2 > ci > 1/2 are certain constants. 

The above lemma therefore implies it is NP-hard to distinguish between the cases when the 
optimum of a PTC is m + cin = (ci + B/2)n and when the optimum is m + C2n = (c2 + B/2)n. 
Since B,ci,C2 are constants, we get the APX-hardness. 

(For the interested reader: the APX-hardness of vertex cover of bounded degree graphs by 
Berman and Karpinski [3] gives B = 4, ci = 78/152 and C2 = 79/152, showing it is NP-hard to 
approximate to a factor better than 1.002.) □ 

4.2 An approximation algorithm for PTC 

The crucial idea is the following. Given an optimum solution S* C S, we can partition the 
edge-set E oiT into disjoint sets Ei, . . . , Ep, and partition two copies of S* into Si, ... , Sp, such 
that Ei is a path in T for each i, and Si is a priority line cover for the path Ei. Once again, we 
assume without loss of generality that the instance is segment-complete. 

In particular, we prove the following lemma. Let Es*j be the set of edges e such that j 
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is the segment with the highest supply, among ah segments in S* that cover e. Note that the 
union of all Es*j, over all j G S*, partitions E. Also note that for each edge e, there is a unique 
segment j such that e G Es*j. If there were two, we could replace one of the segments by a 
sub-segment and still stay feasible. We call the segment j responsible for e. 

Lemma 9. Given an optimal solution S* Q S to a PTC instance with tree T = (y,E), there is 
a partition 

EiU ...UEp = E, 

where each Ei is the edge set of a path in T such that for all j €z S* , Es* j DEi ^ for at most 
two i G {1, . . . 

Using this, we describe the 2-approximation algorithm which proves Theorem [71 

Proof of Theorem For any two vertices t (top) and b (bottom) of the tree T, such that t 
is an ancestor of b, let Ptb be the unique path from b to t. Note that Ptb, together with the 
restrictions of the segments in Sto Ptb, defines an instance of PLC. Therefore, for each pair t 
and b, we can compute the optimal solution to the corresponding PLC instance; let the cost 
of this solution be c^^. Create an instance of the 0,1-tree cover problem with T and segments 
S' := {{t, b) : t is an ancestor of b} with costs c^^. Solve the 0,1-tree cover instance exactly (recall 
we are in the rooted version) and for the segments (t, b) in S' returned, return the solution of 
the corresponding PLC instance of cost c^^. We now use Lemma [9] to obtain a solution to the 
0,1-tree cover problem (T, S') of cost at most 2 times the cost of S* . This will prove the theorem. 

For each Ei, let ti and bi be the end points of Ei with ti being the ancestor of bi. Since 
E'j's partition the edges, the segments {ti,bi) : i = 1, ... ,p is a feasible 0,1-tree cover for (T, S'). 
Define Si := {j G S* : e £ Ei n Es*j} to be the set of segments responsible for the edges in 
Ei. By definition. Si is a PLC for Ei. Thus, the cost of the segments in Si is at least c^,;,,. 
Furthermore, Lemma [9] implies that the total cost of the segments in Si is at most twice the cost 
of segments in S*. Therefore, the cost of the feasible solution to the cover problem in (T,S') is 
at most twice the cost of segments in S*. 

□ 

Proof of Lemma [21 We give an algorithm to compute the decomposition. Let e be any of the 
edges incident to the root of T, and let ji € S* be the highest-supply segment covering e. We 
then let Ei be the edges of the path in T corresponding to ji. Removing Ei from T yields 
sub-trees Ti, . . . ,Tq. For each tree Ti we repeat the above steps, and let 

Ei,...,Ep (17) 

be the final partition; let ji € S* be the segment corresponding to edge-set Ei. Note that for 
q < q' , Es*jqf^Egi is empty. This is because E^i is a subset of edges which are not in . . . 

Consider a segment j € 5, and let 1 < i < p be smallest such that Es*j H Ei ^ 0, and 
assume that Es* j DEg ^ for some i < q < p; choose q smallest with this property. We claim 
that jq = j, and hence for all q < q' < p we have Es*,j H Egi = 0. Thus, Es*j has non-empty 
intersection only with Ei and Eg. 

Let e G Es^jCiEi, and let / G Es*jriEq be two edges in different parts of the partition such 
that j is responsible for both. As both e and / are edges on j, and since i < q, it follows that 
/ is a descendant of e in tree T. Let g be the topmost edge of Eg] clearly, g is on the e, /-path 
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in T. By the decomposition algorithm, segment jq is the highest-supply segment covering edge 
g. As j contains g, this means that the supply of jq is at least that of j. Finally, since / is on 
jq, jq covers / as well. But this means that jg = j as j is responsible for /. □ 

4.3 Canonical LP relaxation of PTC: Integrality Gap 

In this section, we prove Theorem [H by showing that the canonical LP relaxation of unweighted 
PTC is at most 6. Recah the PTC LP. 



mm 



CqXo 



> 1; Xs>0,yseS 



(18) 



s:s covers e 



Proof of Theorem The idea of the proof is the following: as in the factor 2-approximation for 
PTC, we decompose the edge set of the tree into disjoint sets Ei, - ■ ■ ,Ep, such that each Ei 
induces a path. We will abuse notation and refer to the i?j's as paths. Furthermore, we take 
any feasible solution x of (jlSp and obtain p fractional solutions x^^^ , ■ ■ ■ , x^P'^ such that x^*) is a 
feasible fractional solution to (jPrimaip for the PLC instance on the path E^. We will guarantee 
that 



The theorem then follows from Theorem [3l 

Unlike in the argument used in the previous sec- 
tion where the decomposition into paths depended 
on S*, the decomposition into disjoint paths that 
we use here is universal. Each path Ei will end at 
a unique leaf, and p in (jl7p will now be the number 
of leaves of T. Let Ei be any path from the root 
to a leaf. Delete Ei from the tree to get a series of 
sub-trees. Recursively, obtain E2 to Ep. We call a 
path Ei a child of Eg, if the starting point of Ei lies 
on Eg. 

Let X be any feasible fractional solution of ()18p 
and let S* be the support of x, that is, S* = {j : 
Xj > 0}. Fix a path Ei and say that a segment 
j G S* intersects Ei if j covers an edge in Ei A 
segment j that intersects Ei is called local for Ei if 
either the first or the last edge covered by j lies in 
Ei. A segment j that intersects Ei is called global 
for Ei, otherwise. Figure [6] illustrates this. 

Let j be a global segment for Ei, and let e be 
the first edge contained in j after Ei. Ii e G E, 



9' 




Figure 6: The figure shows a fragment 
Ei, its parent Er, and two children Eg 



we call s an iq-global segment. Observe that Eg is 
a child of Ei. Thus an ig-global segment enters Ei 
and exits via Eg. Note that zg-global segments, over 
all q such that Eg is a child of Ei, partition all global segments for E, 
segment could also be a z'g'-global segment for some other i' , q'. 



and Et. The segments ji,j2, and J3 are 
local for Ei, and segment j4 is global. In 
particular, ^4 is an i, t- global segment. 



Also note that an ig-global 
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Now we are ready to define the fractional solution x*-^ that will be feasible for (jPrimaip for 

(i) 

the PLC instance on Ei. Firstly for all segments j that are local for Ei, let Xj = Xj. Next, 
we take care of segments that are global for Ei. For each child Eg of Ei, order all the ig-global 
segments in non-increasing order of supply: {ji, . . . ,jr}- Let / be such that 

Xj, + ---+ Xj^ < 1 and Xj, + ---+ Xj^ + xj^^^ > 1 

(i) (i) 

If no such / exists, then / = r. Define Xj^ = xj^, for 1 < /c < Z. If I < r, then let Xj^^_^ = 



3k ' 



1 - Ei=l X 

Claim 4. x^*) is feasible for (jPrimaip for the PLC instance on Ei. 

Proof. Pick any edge e G Ei. Look at all segments j € S* that cover e. These segments are 
either local for e or global for e. If j is local, there is a corresponding segment in the support 
x^*^ of the same value. Furthermore for any q, 

^ xf^ > min{l, ^ Xj} 

j:j is ig- global, j covers e j-.j is ig-global.j covers e 

In any case, e is covered by x^*^ at least to the extent it is covered by x, which implies x^*^ is 
feasible. □ 



Lemma 10. ^Li EjeS < 3 Y.jeS 



Xa 



Proof. Each segment j G S* is local for at most two paths Ei and Eg. Thus the contribution to 
the LHS by local segments for some path Ei is exactly 2 YljeS 

Furthermore, for every parent-child pair Ei and Eq that induces an if^-global segment for Ei, 
we increase the LHS by at most 1. The number of such pairs is at most the number of leaves in 
T. The proof is complete by noting that X^jg^ Xj is at least the number of leaves in T. □ 

To complete the proof of the theorem, note that from Theorem [3] we know there exists for 
each Ei, a set of segments Si such that \Si\ < 2 x^j \ The union of all such Si forms a valid 
PTC of cardinality at most 6 Xj. □ 

4.4 Priority Tree Cover and Geometric Covering Problems 

In this section, we show that the PTC problem is a special case of covering a set of points 
in 3-dimension by axis-parallel rectangles (cuboids). In particular we prove Theorem [HI We 
go in two steps. We first define a problem, that we call 2-Priority Line Cover and show that 
the PTC problem is a special case of 2-PLC. Subsequently, we show 2-PLC is a special case of 
3-dimensional rectangle cover. We start with a definition of 2-PLC. 

2-Priority Line Cover (2-PLC). The input is a line T = (V, E), and a collection of segments 
5 C y X y with costs Cj for each j € S. Furthermore, each segment j has a priority supply 
vector in two dimensions, denoted as (sj,s^), and each edge e has a priority demand vector in 
two dimensions, denoted as (7rg,7rg). A segment j covers e iff j contains e and s*- > Tr* for both 
i = 1,2. The goal is to find the minimum cost collection of segments that cover every edge. 

It is easy to see that PLC is a special case of 2-PLC. Somewhat surprisingly, PTC is a special 
case of 2-PLC as well. 
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Lemma 11. Any instance of PTC can be encoded as an instance of 2-PLC with the same 
solution set. 

Proof. Given a rooted tree T = {V, E), we perform two different depth first traversals to get two 
different orderings on the edges E. One such ordering will define the line of the 2-PLC instance, 
the other will define the first coordinates of the priority demand vectors of the edges. 

In a depth first traversal of a tree, at every step we move from a vertex to one of its children, 
if any. Our two different traversals will be defined by two different choices of moving to a 
child-vertex. For every vertex v of the tree, consider a total order av on its children. One such 
order that is convenient to keep in mind is the following; given a drawing of the tree, the total 
order of the children is from left to right. Let be the opposite total order. The two depth 
first traversals are obtained by running with a^s and fx^'s, respectively. Figure [7] illustrates the 
two orders with the ordering at every vertex v being from left-to-right, and being from 
right-to- left. 




(8.TT(a)yi0.TT(dy(9.n(e)y4,n(b)y(5,TT(f)y(7.TT(i)y(6.TT(i)yi.TT(c)y(3.TT(g)y2,n(h))^ 
adebfijcgh 



Figure 7: The left most tree is the original tree, the second and third are the two depth first 
traversals. The line below shows the line in the 2-PLC instance. 

Let the two traversals return orderings and /i^ on the edges of the tree. The crucial 
observation is the following: for any vertex v, let {vi, . . . ,Vk) be the children in the av order; 
then fj,{v, vi) < fj,{v, V2) < ■ ■ ■ < fJ-{v, Vk), and thus, fJ.^{v, vi) > ■ ■ ■ > fi^{v, Vk). 

Now we are ready to describe the 2-PLC instance. The line is defined by the edges of the 
tree ordered w.r.t. /i. That is, the order of the edges is (ei, . . . , em) such that /i(ei) < /u(e2) < 
••• < iJ.{em)- The priority demand vector of an edge e of the tree is (/i'^(e), vrg). Consider a 
segment j = (u, v) such that u is a descendant of v in the PTC instance. We identify two 
specific tree edges contained in j: the parent-edge {u,u') of u, and the edge {v,v') between 
node V and its unique child v' that is on the u, f-path in T. By the depth-first property, we 
get ii{v,v') < fi{u,u'). The corresponding segment in the 2-PLC instance, also denoted as j, 
contains all the edges from fj,{v,v') to /u(n, n'). The priority supply vector of j is (/u^(n, n'), Sj). 

Claim 5. For any segment j, the set of edges covered by j in the 2-PLC instance is precisely 
the set of edges covered in the PTC instance. 
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Proof. Let e be an edge covered by j in the PTC instance. Since e is contained in the path from 
n to f in the tree, by property of depth first traversals we get, fi{v,v') < fi{e) < fj,{u,u') and 
lJL^{e) < fi^{u,u'). The first pair of inequahties imphes e hes in the segment j in the 2-PLC 
instance, the second impfies that ttI < sj. Since e is covered by j in the PTC, we also get 
TTg = TTe < Sj = s|. Thus, 6 is covcrcd by j in the 2-PLC instance. 

Let e be an edge covered by j in the 2-PLC instance. Since e hes in j, we conclude fi{v, v') < 
A*(e) < fj,{u,u'). This implies either (a) e lies on the path from u to f in the tree, or, (b) there 
is a node w on the u, v-path in the tree, and a child z oi w that is not on this path such that e 
is contained in the subtree defined by edge {w,z). 

Note, that in case (b) the depth-first traversal for order a visits edge (z, w) before edge 
{u,u'). This implies that the second dfs traversal for order visits {z,w) after {u,u'). Since 
{z,w) is visited before e in both traversals, we must therefore have /^^(e) > fj,^{u,u'), and this 
implies < 7r^(e) which is impossible since j covers e. Thus, case (b) is not possible, and e lies 
on the path fro u to f on the tree. Furthermore, we have Sj = > vTg = vTe, and so j covers e 
in the PTC instance as well. □ 

□ 

Now we show that 2-PLC is a special case of 3-dimensional rectangle cover. This is not to 
hard to see. We assume the edges of the line are numbered (1,2,..., m). For edge e numbered 
ej, we associate a point in 3 dimensions with coordinates {i, vTg, TTg). For each segment j = (a, 6), 
we have a rectangle associated. In fact, these rectangles have are unbounded in the negative y 
and z coordinates. The other 4 bounding half-spaces are x > a, x < b, y < s^{j) and z < 
It is not too hard to see a rectangle corresponding to a segment j contains a point corresponding 
to an edge e iff j covers e in the 2-PLC instance. This completes the proof of Theorem [8j 

5 Concluding Remarks 

In this paper we studied column restricted covering integer programs. In particular, we studied 
the relationship between CCIPs and the underlying 0,1-CIPs. We conjecture that the approx- 
imability of a CCIP should be asymptotically within a constant factor of the integrality gap of 
the original 0,1-CIP. We couldn't show this; however, if the integrality gap of a PCIP is shown 
to be within a constant of the integrality gap of the 0,1-CIP, then we will be done. At this 
point, we don't even know how to prove that PCIPs of special 0,1-CIPS, those whose constraint 
matrices are totally unimodular, have constant integrality gap. Resolving the case of PTC is an 
important step in this direction, and hopefully in resolving our conjecture regarding CCIPs. 
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